Picture for Yuxin Zhang

Yuxin Zhang

Tony

SIDeR: Semantic Identity Decoupling for Unrestricted Face Privacy

Add code
Feb 04, 2026
Viaarxiv icon

KTV: Keyframes and Key Tokens Selection for Efficient Training-Free Video LLMs

Add code
Feb 03, 2026
Viaarxiv icon

Out of the Memory Barrier: A Highly Memory Efficient Training System for LLMs with Million-Token Contexts

Add code
Feb 02, 2026
Viaarxiv icon

FSVideo: Fast Speed Video Diffusion Model in a Highly-Compressed Latent Space

Add code
Feb 02, 2026
Viaarxiv icon

Beyond Pixels: Visual Metaphor Transfer via Schema-Driven Agentic Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

TAG-MoE: Task-Aware Gating for Unified Generative Mixture-of-Experts

Add code
Jan 12, 2026
Viaarxiv icon

GEN3D: Generating Domain-Free 3D Scenes from a Single Image

Add code
Nov 18, 2025
Viaarxiv icon

Test-Time Iterative Error Correction for Efficient Diffusion Models

Add code
Nov 09, 2025
Figure 1 for Test-Time Iterative Error Correction for Efficient Diffusion Models
Figure 2 for Test-Time Iterative Error Correction for Efficient Diffusion Models
Figure 3 for Test-Time Iterative Error Correction for Efficient Diffusion Models
Figure 4 for Test-Time Iterative Error Correction for Efficient Diffusion Models
Viaarxiv icon

Step-Audio-EditX Technical Report

Add code
Nov 05, 2025
Figure 1 for Step-Audio-EditX Technical Report
Figure 2 for Step-Audio-EditX Technical Report
Figure 3 for Step-Audio-EditX Technical Report
Figure 4 for Step-Audio-EditX Technical Report
Viaarxiv icon

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

Add code
Aug 27, 2025
Figure 1 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 2 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 3 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Figure 4 for Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval
Viaarxiv icon